Modified Bacillus thuringiensis insecticidal-crystal protein genes and their expression in plant cells

ABSTRACT

A DNA fragment, encoding all or an insecticidally-effective part of a Bt crystal protein, is modified by changing A and T sequences to corresponding G and C sequences encoding the same amino acids.

This application is a continuation of application Ser. No. 07/937,869, filed as PCT/EP91/00733, Apr. 17, 1991 published as WO91/16432, Oct. 13, 1991, now abandoned.

This invention provides a modified Bacillus thuringiensis ("Bt") gene (the "modified BtICP gene") encoding all or an insecticidally-effective portion of a Bt insecticidal crystal protein ("ICP"). A plant, transformed with the modified Bt ICP gene can show higher expression levels of the encoded ICP and improved insect-resistance.

BACKGROUND OF THE INVENTION

Plant genetic engineering technology has made significant progress during the last 10 years. It has become possible to introduce stably foreign genes into plants. This has provided exciting opportunities for modern agriculture. Derivatives of the Ti-plasmid of the plant pathogen, Agrobacterium tumefaciens, have proven to be efficient and highly versatile vehicles for the introduction of foreign genes into plants and plant cells. In addition, a variety of free DNA delivery methods, such as electropotation, microinjection, pollen-mediated gene transfer and particle gun technology, have been developed for the same purpose.

The major aim of plant transformations by genetic engineering has been crop improvement. In an initial phase, research has been focused on the engineering into plants of useful traits such as insect-resistance. In this respect, progress in engineering insect resistance in transgenic plants has been obtained through the use of genes, encoding ICPs, from Bt strains (Vaeck et al., 1987). A Bt strain is a spore forming gram-positive bacterium that produces a parasporal crystal which is composed of crystal proteins which are specifically toxic against insect larvae. Bt ICPs possess a specific insecticidal spectrum and display no toxicity towards other animals and humans (Gasser and Fraley, 1989). Therefore, the Bt ICP genes are highly suited for plant engineering purposes.

For more than 20 years, Bt crystal spore preparations have been used as biological insecticides. The commercial use of Bt sprays has however been limited by high production costs and the instability of crystal proteins when exposed in the field (Vaeck et al., 1987). The heterogeneity of Bt strains has been well documented. Strains active against Lepidoptera (Dulmage et al., 1981), Diptera (Goldberg and Margalit, 1977) and Coleoptera (Krieg et al., 1983) have been described.

Bt strains produce endogenous crystals upon sporulation. Upon ingestion by insect larvae, the crystals are solubilized in the alkaline environment of the insect midgut giving rise to a protoxin which is subsequently proteolytically converted into a toxic core fragment or toxin of 60-70 kDa. The toxin causes cytolysis of the epithelial midgut cells. The specificity of Bt ICPs can be determined by their interaction with high-affinity binding sites present on insects' midgut epithelia.

The identification of Bt ICPs and the cloning and sequencing of Bt ICP genes has been reviewed by H ofte and Whiteley (1989). The Bt ICP genes share a number of common properties. They generally encode insecticidal proteins of 130 kDa to 140 kDa or of about 70 kDa, which contain toxic fragments of 60±10 kDa (H ofte and Whiteley, 1989). The Bt ICP genes have been classified into four major groups according to both their structural similarities and insecticidal spectra (H ofte and Whiteley, 1989): Lepiaoptera-specific (CryI), Lepidoptera- and Diptera-specific (CryII), Coleoptera-specific (CryIII) and Diptera-specific (Cry IV) genes. The Lepidoptera-specific genes (CryI) all encode 130-140 kDa proteins. These proteins are generally synthesized as protoxins. The toxic domain is localized in the N-terminal half of the protoxin. Deletion analysis of several CryI genes confirm that 3' portions of the protoxins are not absolutely required for toxic activity (Schnepf etal., 1989). Cry II genes encode 65 kDa proteins (Widner and Whiteley, 1985). The Cry II A proteins are toxic against both Lepidoptera and Diptera while the Cry II B proteins are toxic only to Lepidopteran insects. The Coleoptera-specific genes (Cry III) generally encode proteins with a molecular weight of about 70 kDa. (Whiteley and H ofte, 1989). The corresponding gene (cry III A) expressed in E. coli directs the synthesis of a 72 kDa protein which is toxic for the Colorado potato beetle. This 72 kDa protein is processed to a 66 kDa protein by spore-associated bacterial proteases which remove the first 57 N-terminal amino acids (Mc Pherson et al., 1988). Deletion analysis demonstrated that this type of gene cannot be truncated at its 3'-end without the loss of toxic activity (H ofte and Whiteley, 1989). Recently, an anti-coleopteran strain, which produces a 130 kDa, protein has also been described (European patent application ("EPA") 89400428.2). The cry IV class of crystal protein genes is composed of a heterogenous group of Diptera-specific crystal protein genes (H ofte and Whiteley, 1989).

The feasibility of generating insect-resistant transgenic crops by using Bt ICPs has been demonstrated. (Vaeck et al., 1987; Fischoff et al., 1987 and Barton et al., 1987). Transgenic plants offer an attractive alternative and provide an entirely new approach to insect control in agriculture which is at the same time safe, environmentally attractive and cost-effective. (Meeusen and Warren, 1989). Successful insect control has been observed under field conditions (Delannay et al., 1989; Meeusen and Warren, 1989).

In all cases, Agrobacterium-mediated gene transfer has been used to express chimaeric Bt ICP genes in plants (Vaeck et al., 1987; Barton et al., 1987; Fischoff et al., 1987). Bt ICP genes were placed under the control of a strong promoter capable of directing gene expression in plant cells. It is however remarkable that expression levels in plant cells were high enough only to obtain insect-killing levels of Bt ICP genes when truncated genes were used (Vaeck et al., 1987; Barton et al., 1987). None of the transgenic plants containing a full-length Bt ICP gene produced insect-killing activity. Moreover, Barton et al. (1987) showed that tobacco calli transformed with the entire Bt ICP coding sequence became necrotic and died. These results indicate that the Bt ICP gene presents unusual problems that must be overcome to obtain significant levels of expression in plants. Even, when using a truncated Bt ICP gene for plant transformation, the steady state levels of Bt ICP mRNA obtained in transgenic plants are very low relative to levels produced by both an adjacent NPT II-gene, used as a marker, and by other chimetic genes (Barton et al., 1987; Vaeck et al., 1987). Moreover, the Bt ICP mRNA cannot be detected by northern blot analysis. Similar observations were made by Fischoff et al. (1987); they reported that the level of Bt ICP mRNA was much lower than expected for a chimeric gene expressed from the CaMV35S promoter. In other words, the cytoplasmic accumulation of the bt mRNA, and consequently the synthesis, the accumulation and thereby the expression of the Bt ICP protein in plant cells, are extremely inefficient. By contrast, in microorganisms, it has been shown that truncated Bt ICP genes are less favorable than full-length genes (Adang et al., 1985), indicating that the inefficient expression is solely related to the heterologous expression of Bt ICP genes in plants.

The problem of obtaining significant Bt ICP expression levels in plant cells seems to be inherent and intrinsic to the Bt ICP genes. Furthermore, the relatively low and poor expression levels obtained in plants appears to be a common phenomenon for all Bt ICP genes.

It is known that there are six steps at which gene expression can be controlled in eucaryotes (Darnell, 1982):

1) Transcriptional control

2) RNA processing control

3) RNA transport control

4) mRNA degradation control

5) translational control

6) protein activity control

For all genes, transcriptional control is considered to be of paramount importance (The Molecular Biology of the Cell, 1989).

In European patent publications ("EP") 385,962 and 359,472, efforts to modify the codon usage of Bt ICP genes to improve their expression in plant cells have been reported. However, wholesale (i.e., non-selective) changes in codon usage can introduce cryptic regulatory signals in a gene, thereby causing problems in one or more of the six steps mentioned above for gene expression, and thus inhibiting or interfering with transcription and/or translation of the modified foreign gene in plant cells. For example, changes in codon usage can cause differential rates of mRNA production, producing instability in the mRNA, so produced (e.g., by exposure of regions of the mRNA, unprotected by ribosomes, to attack and degradation by cytoplasmic enzymes). Changes in codon usage also can inadvertantly cause inhibition or termination of RNA polymerass II elongation on the so-modified gene.

SUMMARY OF THE INVENTION

In accordance with this invention is provided a process for modifying a foreign gene, particularly a Bt ICP gene, whose level and/or rate of expression in plant cells, transformed with the gene, is limited by the rate and/or level of nuclear production of an mRNA encoded by the gene; the process comprises the step of changing adenine and thymine sequences to corresponding guanine and cytosine sequences encoding the same amino acids in a plurality of translational codons of the gene that would otherwise directly or indirectly cause a nuclear event which would negatively control (i.e., inhibit or interfere with) transcription, nuclear accumulation and/or nuclear export of the mRNA, particularly transcription, quite particularly elongation of transcription by RNA polymerass II of the plant cells. Preferably, the adenins and thymine sequences are changed to cytosine and guanine sequences in translational codons of at least one region of the gene which, during transcription, would otherwise have thereon a relatively low percentage of RNA polymerase II as compared to another adjacent upstream (i.e., 5') region of the gene.

Also in accordance with this invention is provided the modified Bt ICP gene resulting from the process.

Further in accordance with this invention, a process is provided for improving the resistance of a plant against insect pests by transforming the plant cell genome with at least one modified Bt ICP gene.

This invention also relates to a chimaeric gene that can be used to transform plant cells and that contains the following operably-linked DNA fragments in the same transcriptional unit:

1) the modified Bt ICP gene;

2) a promoter suitable for directing transcription of the modified Bt ICP gene in the plant cells; and

3) suitable transcript 3' end formation and polyadenylation signals for expressing the modified Bt ICP gene in the plant cells.

This invention further relates to:

a cell of a plant, the nuclear genome of which has been transformed to contain, preferably stably integrated therein, the modified Bt ICP gene, particularly the chimaeric gene;

cell cultures consisting of the plant cell;

a plant which is regenerated from the transformed plant cell or is produced from the so-regenerated plant, the genome of which contains the modified Bt ICP gene, particularly the chimaeric gene, and which shows improved resistance to insect pests;

seeds of the plant; and

a vector for stably transforming the nuclear genome of plant cells with the modified Bt ICP gene, particularly the chimaeric gene.

DETAILED DESCRIPTION OF THE INVENTION

As used herein, "Bt ICP" should be understood as an intact protein or a part thereof which has insecticidal activity and which can be produced in natureby B. thuringiensis. A Bt ICP can be a protoxin, as well as an active toxin or other insecticidal truncated part of a protoxin which need not be crystalline and which need not be a naturally occurring protein. An example of a Bt ICP is a Bt2 insecticidal crystal protein (H ofte et al., 1986), as well as its insecticidally effective parts which are truncated at its C- and/or N-terminal ends towards its tryspsin cleavage site(s) and preferably having a molecular weight of 60-80 kDa. Other examples of Bt ICPs are: Bt2, Bt3, Bt4, Bt13, Bt14, Bt15, Bt18, Bt21, Bt22, Bt73, Bt208, Bt245, BtI260 and BtI109P as disclosed in PCT publications WO90/15139 and W090/09445, in H ofte and Whiteley (1989) and in EPA 90403724.9.

As used herein, "protoxin" should be understood as the primary translation product of a full-length gene encoding a Bt ICP.

As used herein, "toxin" or "active toxin" or "toxic core" should all be understood as a part of a protoxin which can be obtained by protease (e.g., by trypsin) cleavage and has insecticidal activity.

As used herein, "truncated Bt gene" should be understood as a fragment of a full-length Bt gene which still encodes at least the toxic part of the Bt ICP, preferentially the toxin.

As used herein, "modified Bt ICP gene" should be understood as a DNA sequence which encodes a Bt ICP, and in which the content of adenins ("A") and thymine ("T") has been changed to guanine ("U") and cytosine ("C") in codons, preferably at least 3, in at least one region of the DNA sequence without affecting the original amino acid sequence of the Bt ICP. Preferably in at least two regions, especially in at least three regions, of the DNA sequence, the A and T content is changed to G and C in at least 3 codons. For regions downstream of the translation initiation site of the DNA sequence, it is preferred that the A-T content of at least about 10 codons, particularly at least about 33 codons, be changed to G-C.

By "region" of a modified Bt ICP gene is meant any sequence encoding at least three translational codons which affect expression of the gene in plants.

In accordance with this invention, it has been shown by means of mRNA turn-over studies that the expression pathway of a Bt ICP gene, such as bt2, bt14, bt15 and bt18, is specifically inhibited at the nuclear level in plant cells. In a further analysis, nuclei of transgenic tobacco plants, i.e., N28-220 (Vaeck et al., 1987), were used in a nuclear run-on assay to determine the distribution and the relative efficiency of RNA polymerass II complexes to initiate transcription of chimaeric Bt ICP plant genes. In this regard, the run-on assay has been used to determine initially the relative efficiency of RNA polymerase II complexes to initiate transcription of Bt ICP genes and thereafter to determine the relative distribution and migration efficiency of the RNA polymerase II complexes on the Bt ICP genes.

N28-220 contains the bt884 fragment under control of the TR 2' promoter as a chimaeric gene. Bt884 is a 5' fragment of the bt2 gene (H ofte et al., 1986) up to codon 610 (Vaeck et al., 1987). Using nuclear run-on analysis, isolated nuclei of N28-220 were incubated with highly labeled radioactive RNA precursors, so that the RNA transcripts being synthesized at the time became radioactively labeled. The RNA polymerase II molecules caught in the act of transcription in the cell continue elongating the same RNA molecules in vitro.

The nuclear run-on assays of nuclei of N28-220 culture (non-induced cells and induced cells, TRl'-neo, TR2'-bt884) revealed that transcription from the TR1' and TR2' promoters is about equally efficient. This implies that the low Bt ICP (i.e., Bt884) expression levels are not due to a specifically reduced transcriptional activity of the TR2' promoter. However, nuclear run-on analysis with N28-220 nuclei indicated that transcription elongation of the nascent Bt ICP mRNA is impaired somewhere between 700 to 1000 nucleotides downstream of the start of transcription. This means that RNA polymerass II is not able to transcribe the Bt ICP coding sequence with 100% efficiency. Filter binding assays using labeled Bt DNA fragments spanning this region and protein extract prepared from tobacco nuclei reveal that this DNA region undergoes specific interactions with proteins present in nuclei. These interactions are the prime candidates that cause or affect the impaired elongation of transcription by RNA polymerase II through this region. By modification of this region to abolish specific protein binding, Bt ICP expression levels will increase. However, other mechanisms responsible for impaired elongation in this region cannot be excluded.

Further in accordance with this invention, sequences within the coding region involved in negative control of cytoplasmic Bt ICP mRNA levels have been identified by deletion analysis. To this end, 24 deletion derivatives of pVE36 have been constructed. Three main types of deletion mutants have been constructed (see FIG. 3):

5' end deletions

3' end deletions

internal deletions.

The expression of a mutant hybrid bt2-neo gene (encoding a fusion protein of Bt2 (H ofte et al., 1986) and NPTII) has been studied by means of transient expression experiments using the cat gene as a reference. To this end, the neo mRNA levels were measured in relation to cat mRNA levels in RNA extracts of SR1 protoplasts. The ratio between the neo and cat mRNA level was used to quantify on a relative basis the nptII transcript (i.e., mRNA) levels produced by the different constructions. These experiments show that progressive deletions of the carboxy-terminal (i.e., 3') part or the amino-terminal (i.e., 5') part of the Bt ICP coding sequence result in a gradual increase of the nptII transcript level. Furthermore, since the changes in transcript levels are not very abrupt, these results suggest that the low transcript levels produced by Bt ICP genes are not controlled by a single factor. Nevertheless, individual modifications of bt2 coding sequence can significantly reduce the interference and/or inhibition of the expression of the mRNA encoded by Bt ICP genes in plant cells at the level of transcript elongation, nuclear accumulation and nuclear export. The modification(s) may also affect cytoplasmic regulation and metabolism of such mRNAs and their translation.

Deletion analysis clearly indicates that several internal sequences, located within the Bt ICP coding region, might be involved in the negative regulation of the Bt ICP expression. By way of example, a 326 bp region (FIG. 6b) was identified in the bt2 gene that is involved in the negative control of BT ICP expression and that is located between nucleotide position 674 and nucleotide position 1000, particularly a 268 bp region between nucleotide positions 733 and 1000, quite particularly a 29 bp region between nucleotide positions 765 and 794 which carries two perfect CCAAT boxes which are known to be able to cause a reduction in elongation efficiency and termination of transcription by RNA polymerass II in animal systems (Connelly and Manley, 1989). This internal gene fragment or inhibitory tone may itself comprise a plurality of inhibitory tones which reduce Bt ICP expression levels or which interact directly or indirectly with other zones to inhibit or interfere with expression. Codon usage of this inhibitory zone has been modified in a second step by substituting A-T with G-C without affecting the amino acid sequence. In this regard, this internal 326 bp fragment (FIG. 6b) has been replaced with a modified Bt ICP fragment of this invention containing 63 modified codons. The effect of such modification of this inhibitory zone on Bt ICP expression has been analyzed both in transient and stable plant transformants. The results show that such modification of codon usage causes a significant increase of Bt ICP expression levels and hence improved insect-resistance.

In addition, N-terminal deletion mutants of the bt2 gene have been made by deleting the first N-terminal 28 amino acids (H ofte et al., 1986). It is known for the bt2 gene that the first 28 codons can be deleted without loss of toxicity (H ofte et al., 1986; Vaeck et al., 1987). Also, codon usage for three codons, 29 to 31, has been changed in accordance with this invention by replacing A-T with G-C without affecting the amino acid sequence. Furthermore, an optimal translation initiation (ATG) site was created based on the consensus sequence of Joshi (1987) as shown in FIG. 6a. Plants transformed with this modified Bt ICP gene show significantly higher Bt ICP expression levels.

In accordance with this invention, all or part of a modified Bt ICP gene of the invention can be stably inserted in a conventional manner into the nuclear genome of a plant cell, and the so-transformed plant cell can be used to produce a transgenic plant showing improved expression of the Bt ICP gene. In this regard, a disarmed Ti-plasmid, containing the modified Bt ICP gene, in Agrobacterium (e.g., A. tumefaciens) can be used to transform a plant cell using the procedures described, for example, in EP 116,718 and EP 270,822, PCT publication 84/02913, EPA 87400544.0 and Gould et al. (1991) (which are incorporated herein by reference). Preferred Ti-plasmid vectors contain the foreign DNA sequence between the border sequence, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example, in EP 233,247), pollen mediated transformation (as described, for example, in EP 270,356, PCT publication WO 85/01856, and U.S. Pat. No. 4,684,611), plant RNA virus-mediated transformation (as described, for example, in EP 67,553 and U.S. Pat. No. 4,407,956), liposome-mediated transformation (as described, for example, in U.S. Pat. No. 4,536,475) and other methods such as the recently described methods for transforming certain lines of corn (Fromm et al., 1990; Gordon-Kamm et al., 1990).

Preferably, the modified Bt ICP gene is inserted in a plant genome downstream of, and under the control of, a promoter which can direct the expression of the gene in the plant cells. Preferred promoters include, but are not limited to, the strong constitutive 35S promoter (Odell et al., 1985) of cauliflower mosaic virus; 35S promoters have been obtained from different isolates (Hull and Howell, Virology 86, 482-493 (1987)). Other preferred promoters include the TR1' promoter and the TR2' promoter (Velten et al., 1984). Alternatively, a promoter can be utilized which is not constitutive but rather is specific for one or more tissues or organs. For example, the modified Bt ICP gene can be selectively expressed in the green tissues of a plant by placing the gene under the control of a light-inducible promoter such as the promoter of the ribulose-1,5-phosphate-carboxylase small subunit gene as described in EPA 86300291.1. Another alternative is to use a promoter whose expression is inducible by temperature or chemical factors.

It is also preferred that the modified Bt ICP gene be inserted upstream of suitable 3' transcription regulation signals (i.e., transcript 3' end formation and polyadenylation signals) such as the 3' untranslated end of the octopine synthase gene (Gielen et al., 1984) or T-DNA gene 7 (Velten and Schell, 1985).

The resulting transformed plant of this invention shows improved expression of the modified Bt ICP gene and hence is characterized by the production of high levels of Bt ICP. Such a plant can be used in a conventional breeding scheme to produce more transformed plants with the same improved insect-resistance characteristics or to introduce the modified Bt ICP gene into other varieties of the same or related plant species. Seeds, which are obtained from the transformed plants, contain the modified BtICP gene as a stable genomic insert.

Furthermore, at least two modified BtICP genes, coding for two non-competitively binding anti-Lepidopteran or anti-Coleopteran Bt ICPs, can be cloned into a plant expression vector (EPA 89401499.2). Plants transformed with such a vector are characterized by the simultaneous expression of at least two modified BtICP genes. The resulting transgenic plant is particularly useful to prevent or delay development of resistance to Bt ICP of insects feeding on the plant.

The following Examples illustrate the invention and are not intended to limit its scope. The Figures, referred to in the Examples, are as follows:

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1--Comparison of the transcription initiation frequency of RNA polymerase II complexes in nuclei of N28-220. Hybridisation efficiencies of labeled nptII mRNA and Bt ICP mRNA with their complementary DNA counterparts present on a Southern blot were compared. DNA fragments were obtained from a digest of plasmid pGSH163. A schematic view of the region is given. The lengths of the fragments blotted on Hybond-N filter (1), the homologous genes on plasmid pGSH163 (2), and the densitometric values (3) are as follows:

    ______________________________________                                         Digest:      1          2       3                                              ______________________________________                                         BamHI/HindIII                                                                               2358       neo     12386                                                       1695       bt2      6565                                                       154        bt2     --                                                          6250       vector  --                                             ______________________________________                                    

FIG. 2a--Determination of the distribution of the RNA polymerase II complexes on the Bt ICP coding sequence in nuclei of N28-220. The hybridisation of labeled RNA prepared by nuclear run on with DNA fragments of the Bt ICP coding sequence was quantitated. The restriction fragments and scanning values are given in the table (FIG. 2A-2) and figure (FIG. 2A-1). The scanning value is proportional to "X", the size of the DNA fragment and the # UTP per RNA fragment hybridising. "X" is directly proportional to the number of RNA polymerases passing through the DNA fragment. "X" is proportional to the scanning value divided by the number of UTPs. The X values of the different restriction fragments are shown in the figure. In this regard, conversion of the different densitometric values into relative hybridisation efficiencies by normalising the values of the number of dATPs present in the DNA fragment, complementary to the hybridising RNA, generates the value "X". "X" is a relative measure of the number and the length of the extension of the transcripts. "X" thus reflects the number of RNA polymerases transcribing a specific DNA sequence and their elongation rate. DNA fragments present on the Southern digests of plasmid DNA of plant vector pGSH163 each have the following lengths of fragments blotted on Hybond-N filter (1), homologous genes on plasmid pGSH163 (2) and densitometric values (3):______________________________________Digest: 1 2 3______________________________________BamHI/EcoRI 8877 neo 15333 726 bt2(2) 2926 583 bt2(3) 635 271 bt2(1) --BamHI/EcoRV 8887 neo 15182 84 bt2 2466 729 bt2 1102BamHI/HindIII 6250 -- -- 2358 neo 12386 1695 bt2 6565 154 -- --BamHI/SacI 8053 neo 14194 1353 bt2(1) 4572 1051 bt2(2) 615XmnI 4973 neo 13219 2107 -- -- 1401 -- -- 729 bt2(3) 736 628 bt2(2) 1817 305 bt2(4) -- 188 bt2(5) -- 120 bt2(1) --______________________________________

FIG. 2b--Schematic view of nine bt884 DNA fragments that were inserted into the polylinker of M13 vectors, MP18 and MP19 (Yanisch-Perron et al., 1985). The Bt ICP coding sequence is shown from AUG to 1600 nucleotides downstream. The relevant restriction sites and sizes of the DNA fragments are indicated. The nucleotide numbering is relative to the AUG. The subclones were named pJD71, pJD72, pJD73, etc. (to pJD79 ), as indicated. The inserts were oriented into the M13 vector such that single standed M13 carried the fragments of the Bt ICP coding sequence in an anti-sense orientation.

FIG. 2c--Schematic representation of three nuclear run-on analyses with N28-220 nuclei as described by Cox and Goldberg (1988). Assays were performed for periods of 5, 10 and 30 minutes. The labeled nuclear RNA was allowed to hybridize with 5 μg of single stranded pJD71-pJD79 and MP18 DNA, which were immobilised on nylon membranes. The membranes were autoradiographed, and densitometric values were obtained by scanning the autoradiographs. The abscissa shows the nucleotide position relative to the AUG of the Bt (i.e., bt2) coding sequence. The center of each of the single stranded Bt DNA fragments is indicated in the graph. The ordinate gives the relative hybridisation signal for each fragment corrected for the number of dATPs in the fragment and adjusted to 100% for the value of pJD71 for each of the three incubation periods. All values are corrected for non-specific hybridisation to single stranded MP18 DNA. The relative values are a measure for the reactivation of bt mRNA synthesis by RNA polymerass II. The assay does not distinguish between the number of mRNA extensions and the length of mRNA extensions.

FIG. 3--Construction of deletion mutants of the bt860-neo gene to measure the effect on cytoplasmic Bt ICP mRNA levels. The parental vector pVE36 (FIG. 3A) is shown. The following deletion mutants were generated:

1. PJD50 (FIG. 3B): pJD50 was derived from pVE36 by digesting with BamHI and SphI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

2. PJD51 (FIG. 3B): pJD51 was derived from pVE36 by digesting with SpeI and SphI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerass I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

3. PJD52 (FIG. 3B): pJD52 was derived from pVE36 by digesting with EcoRV and SphI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

4. PJD53 (FIG. 3B): pJD53 was derived from pVE36 by digesting with XcaI and SphI. The 3' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

5. PJD54 (FIG. 3B): pJD54 was derived from pVE36 by digesting with AflII and SphI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

6. PJD55 (FIG. 3B): pJD55 was derived from pVE36 by digesting with ClaI and SphI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

7. PJD56 (FIG. 3B): pJD56 was derived from pVE36 by digesting with XhoI and SphI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

8. PJD57 (FIG. 3B): pJD57 was derived from pVE36 by digesting with AflII and BamHI. The 5' and 3' protruding ends were filled in with Klenow DNA polymerass I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

9. PJD58 (FIG. 3B): pJD58 was derived from pVE36 by digesting with XcaI and BamHI. The 5' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

10. PJD59 (FIG. 3B): pJD59 was derived from pVE36 by digesting with EcoRV and BamHI. The 5' protruding ends were filled in with Klenow DNA polymerass I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

11. PJD60 (FIG. 3B): pJD60 was derived from pVE36 by digesting with SpeI and BamHI. The 5' protruding ends were filled in with Klenow DNA polymerase I enzyme. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

12. PJD61 (FIG. 3C): PJD61 was derived from PJD50. PVE36 was digested with XbaI and filled in with Klenow polymerase I. PJD50 was linearized with BamHI and filled in with Klenow polymerase I. The 375 bp XbaI fragment of PVE36 was ligated in the filled in BamHI of pJD50. The ligation mixture was used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

13. PJD62 (FIG. 3C): PJD62 was derived from PJD50. PVE36 was digested with XcaI and EcoRV. PJD50 was linearized with BamHI and filled in with Klenow polymerase I. The 367 bp XcaI-EcoRV fragment of PVE36 was ligated in the filled in BamHI of pJD50. The ligation mixture was used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

14. PJD63 (FIG. 3C): PJD63 was derived from PJD50. PVE36 was digested with XcaI and EcoRV. PJD50 was linearized with BamHI and filled in with Klenow polymerase I. The 474 bp XcaI-EcoRV fragment of PVE36 was ligated in the filled in BamHI of pJD50. The ligation mixture was used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

15. PJD64 (FIG. 3C): PJD64 was derived from PJD50. PVE36 was digested with EcoRI aM EcoRV and filled in with Klenow polymerase I. PJD50 was linearized with BamHI and filled in with Klenow polymerase I. The 458 bp EcoRI-EcoRV fragment of PVE36 was ligated in the filled in BamHI of pJD50. The ligation mixture was used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

16. PJD65 (FIG. 3C): PJD65 was derived from PJD50. PVE36 was digested with EcoRI and XbaI and filled in with Klenow polymerase I. PJD50 was linearized with BamHI and filled in with Klenow polymerase I. The 327 bp EcoRI-XbaI fragment of PVE36 was ligated in the filled in BamHI of pJD50. The ligation mixture was used to transform MC1061 cells. Transformante were selected for amp^(r) phenotype.

17. PJD66 (FIG. 3C): PJD66 was derived from PJD50. PVE36 was digested with SpeI and XcaI and filled in with Klenow polymerase I. PJD50 was linearized with BamHI and filled in with Klenow polymerase I. The 1021 bp SpeI-XcaI fragment of PVE36 was ligated in the filled in BamHI of pJD50. The ligation mixture was used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

18.PPS56D1 (FIG. 3D): PPS56D1 was derived from PJD56 by digesting with EcoRV. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

19. PPS56D2 (FIG. 3D): PPS56D2 was derived from PJD56 by digesting with XcaI and AflII. The 5' protruding ends were filled in with Klenow polymerase I. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

20. PPS56D3 (FIG. 3D): PPS56D3 was derived from PJD56 by digesting with SpeI and EcoRV. The 5' protruding ends were filled in with Klenow polymerase I. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

21. PPS56D4 (FIG. 3D): PPS56D4 was derived from PJD56 by digesting with XcaI and partially with EcoRV. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

22. PPS56D6 (FIG. 3D): PPS56D6 was derived from PJD56 by digesting with SpeI and partially with EcoRV. The 5' protruding ends were filled in with Klenow polymerase I. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

23. PPS56D7 (FIG. 3D): PPS56D7 was derived from PJD56 by digesting with SpeI and XcaI. The 5' protruding ends were filled in with Klenow polymerase I. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

24. PPS56D8 (FIG. 3D): PPS56D8 was derived from PPS56D2 by digesting with SpeI and partially with EcoRV. The 5' protruding ends were filled in with Klenow polymerase I. The treated DNA was ligated and then used to transform MC1061 cells. Transformants were selected for amp^(r) phenotype.

FIG. 4--Effect of deletions in the Bt ICP coding sequence on cytoplasmic Bt ICP mRNA levels. The cytoplasmic mRNA levels specified by the invariable cat reference gene and the different Bt ICP deletion mutants described in FIG. 3 are listed in the table. The measurements were converted into relative Bt ICP mRNA abundances. Bt ICP and cat mRNA quantizations were done as described by Cornelissen (1989). Total RNA was slot blotted and hybridised with radioactively labeled RNA complementary to the neo and cat coding sequences. Values were quantitated with the aid of calibration curves of cold cat and Bt ICP riboprobe transcripts.

FIG. 5--Relative transcript levels produced by the deletion derivatives of pVE36.

FIG. 6a [SEQ ID NOS:7-10]--Schematic presentation of the synthetic DNA sequences used to introduce a N-terminal deletion and a change of the codons 29, 30 and 31 of the bt2 coding sequence. The oligo nucleotides [SEQ ID NOS.11-20] were annealed according to Engler et al. (1988) and cloned into the BstXI restriction site of plasmid pVE36, yielding pPSO27. The 7360 bp fragment of pPSO27 was ligated to the the 1177 bp ClaI restriction fragment of pVE36, yielding plasmid pPSO28. pPSO28 is identical to pVE36 apart for the N-terminal modification.

FIG. 6b [SEQ ID NO:21]--Schematic presentation of the synthetic DNA sequences used to introduce an internal modification into the bt2 coding sequence. The oligonucleotides (FIG. 68-1) were annealed and ligated (FIG. 68-2) as described by Engler et al. (1988) and the resulting concatemeric DNA fragment was cut with the restriction enzymes XbaI and EcoRI to release the modified 327 bp XbaI-EcoRI restriction fragment. This fragment was ligated into the 3530 bp EcoRI-XbaI fragment of pPS023 which is a pUC19 derivative (Yanisch-Perron et al., 1985) that carries the 1533 bp AflII (filled in) BamHI fragment of pVE36 in the HindIII (filled in) BamHI site of pUC19, resulting in plasmid pPSO24. Plasmid pPSO24 was linearised by digestion with restriction enzyme XbaI and the 375 bp XbaI restriction fragment of pPS023 was introduced resulting in pPS025. The 1177 bp ClaI fragment of pPSO25 was introduced in the 7360 bp ClaI restriction fragment of pPS027 yielding pPS029. pPS029 is identical to pVE36 but carries both the amino-terminal modification and the internal modification of the Bt ICP coding sequence.

FIG. 6c (FIG. 6C-1)--Nucleotide sequences 800 to 4000 of the plasmids pVE36 [SEQ ID NO:22] and (FIG. CC-2) pPS029 [SEQ ID NO.23]. "x" refers to not known nucleotides.

FIG. 7--Schematic presentation of the effect of the mutations on the AT content of the Bt ICP plant gene. (pVE36: FIG. 7A; pPS029: FIG. 7B). The modified regions are indicated.

FIGS. 8A-1 and -2--Schematic presentation of the plasmid constructions used in the transient expression assay. The relevant genes are indicated.

FIGS. 8B-1, -2, and -3--Accumulation profiles of CAT (Neumann et al., 1987) and the modified BtICP (Engvall and Pesce, 1978) in a typical transient expression assay.

Unless otherwise stated in the Examples, all procedures for making and manipulating recombinant DNA are carried out by the standardized procedures described in Sambrook et al., Molecular Cloning--A laboratory Manual, Cold Spring Harbor Laboratory (1989).

DESCRIPTION OF THE PREFERRED EMBODIMENTS Example 1. Determination of the Efficiency of Transcription Initiation

The relative efficiency of RNA polymerase II complexes to initiate transcription at chimaeric BtICP plant genes was studied, using transgenic plant N28-220 which is described by Vaeck et al. (1987) and contains copies of the T-DNA of plasmid pGSH163. This T-DNA carries the chimaeric plant genes P_(TR2) bt8843'g7 and P_(TR1') neo3'ocs. Nuclei of 25 g of induced leaves of N28-220 were prepared according to Cox and Goldberg (1988) and stored the nuclei at a temperature of -70° C. This method causes the nascent precursor mRNA chains and the RNA polymerase II complexes to halt while the complexes remain associated at the DNA. A batch of these nuclei was assayed for the ability to incorporate radioactively labeled UTP as a measure for the transcriptional viability of the nuclei (Cox and Goldberg (1988). This incorporation could be successfully repressed by addition of α-amanitin to a final concentration of 2 μg/ml. This shows that the UTP incorporation was due to transcript elongation by RNA polymerass II and that RNA synthesis on the protein coding genes which are occupied by RNA polymerase II can be reactivated under the appropriate experimental conditions.

Batches of the nuclei of N28-220 were used to synthesize radioactively labeled RNA as described by Cox and Goldberg (1988). The radioactive RNA synthesized is a direct representation of the distribution of the RNA polymerases II complexes on the DNA in the nuclei. As the DNA of N28-220 carries two genes which can be assayed, namely the chimaeric neo gene and the chimaeric Bt ICP gene, it is possible to compare the distribution of RNA polymerass II complexes on these two genes. To this end, the radioactive RNA was extracted from the nuclei according to Cox and Goldberg (1988) and used as a probe in a conventional Southern hybridisation. The Southern blot contained DNA fragments carrying the Bt ICP and neo coding sequences in a molar excess relative to the neo and Bt ICP RNA species present in the radioactive probe. A detailed description of the Southern blot is given in FIG. 1. The hybridisation experiment resulted in hybridisation signals to both the neo and Bt ICP coding sequences (FIG. 1). Densitometric scanning showed that the intensity of the hybridisation signal to the neo and Bt ICP coding regions was nearly identical. This result implies that the number of transcripts initiating from the TR dual promoter is about similar in both directions. As in plant N28-220 the cytoplasmic mRNA level is several magnitudes higher than that of Bt ICP; this shows that the Bt ICP coding sequence indeed negatively controls accumulation of cytoplasmic Bt ICP mRNA, but that this phenomenon is not due to a dominant negative effect on transcription initiation of the chimaeric Bt ICP plant gene.

Example 2. Transcription Elongation

The relative distribution of RNA polymerase II complexes on the Bt ICP plant genes present in transgenic plant N28-220 which is described by Vaeck et al. (1987) was investigated. To this end, a second experiment was carried out with batches of the nuclei of N28-220 described in Example 1.

The nuclei were incubated as described by Cox and Goldberg (1988) to synthesize radioactively labeled RNA. The radioactive RNA was extracted as described previously to provide a probe for a Southern hybridisation. The Southern blot prepared for this experiment contained several fragments of the Bt ICP coding sequence in molar excess relative to the complementary RNA present in the probe. The rationale of the experiment was that if the RNA polymerase II complexes were equally distributed over the Bt ICP coding region, the hybridisation with the different Bt ICP DNA fragments present on the Southern blot would be proportional to the size and dATP content of the different fragments. A detailed description of the DNA fragments present on the Southern is given in FIG. 2a. The hybridisation of the radioactive RNA extracted from the nuclei of N28-220 with the Southern revealed that the complete Bt ICP coding sequence as present in N28-220 is transcribed by RNA polymerase II.

Quantification of the hybridisation signals by densitometric scanning of the autoradiogram showed that more radioactively labeled RNA was hybridising with DNA fragments representing Bt ICP sequences located 5' on the Bt ICP coding sequence than with Bt ICP sequences located 3' on the Bt ICP coding sequence. The actual values are given in FIG. 2a. This in vitro experiment demonstrates that in vivo the RNA polymerases are not evenly distributed over the Bt ICP coding sequence.

The site(s) involved in reducing the RNA polymerase II elongation were then determined more accurately. Nine M13 derivatives were made that carry overlapping fragments of the Bt2 coding sequence spanning the region from the AUG to 1584 nucleotides downstream. The inserts were oriented into the vector such that, in single stranded M13 derivatives, the Bt sequences were complementary to the Bt transcript. A schematic view of the M13 clones is given in FIG. 2b.

A molar excess of each single stranded anti-Bt DNA was bound to nylon filters to serve as a DNA target for hybridisation with labeled RNA prepared from nuclear run-on assays with N28-220 nuclei as described by Cox and Goldberg (1988). Three nuclear run-one that differed only in their time period of incubation were carried out simultaneously. The incubation time determines the length of extension of the nascent mRNA chain. Shorter incubation periods give a more accurate view of the position of the RNA polymerase II complexes relative to the substrate DNA and their ability to elongate at the moment of the start of incubation. Hence, the shorter the in yitro incubation period, the more accurate the view in predicting the in vivo situation.

The results are shown in FIG. 2c. The data for the 5 minute incubation show that, in vivo, at a very discrete inhibitory zone along the bt2 coding sequence, one or more factors interfere with transcript elongation and that such factor(s) remain present in such inhibitory zone during the course of the in vitro mRNA extension reaction. Increased incubation periods show that, on a subset of DNA templates, RNA synthesis resumed downstream of such inhibitory zone in this assay without significantly removing the inhibition in the inhibitory zone itself. In this regard, the data indicate that:

1. The inhibitory zone causes the RNA polymerases to pause and not to terminate.

2. This pause is only transitory for a small fraction of the Bt DNA templates which were used.

3. The continued RNA polymerase elongation, downstream of the inhibitory zone, is done by a large number of polymerases on the relatively small fraction of the Bt DNA templates.

It is believed, therefore, that low cytoplasmic Bt mRNA levels are due at least in part to inefficient production of precursor mRNA caused by inefficient elongation of a nascent transcript and/or stalling of RNA polymerase II complexes from transcribing at an inhibitory zone.

The inhibitory zone was assayed for its ability to interact with proteins present in nuclei of tobacco protoplasts. A crude nuclear extract was prepared from tobacco SR1 leaf protoplasts according to Luthe and Quatrano (1980) and used for filter binding assay essentially as described by Diffley and Stillman (1986). 100 ng samples of protein extract were mixed with different amounts of radioactively labeled 532 bp XbaI-AccI bt884 DNA fragment, ranging from 0 to 1670 picomolar, in a final volume of 0.150 ml binding buffer (10 mM Tris pH 7.5, 50 mM NaCl, 1 mM DTT, 1 mM EDTA and 5% glycerol). After 45 minutes incubation at room temperature, the samples were filtered through an alkali-washed nitrocellulose membrane and washed twice with 0.150 ml of an ice-cold solution containing 10 mM Tris pH 7.5, 50 mM NaCl and 1 mM EDTA. The retention of DNA-protein complex was quantified by scintillation counting and revealed that the binding had a dissociation constant in the 100 picomolar range. The binding was not affected by preincubation of the nuclear extract with a molecular excess of a specific competitor DNA.

Example 3. Construction of Deletion Mutants

The previous two examples demonstrate that the Bt ICP coding sequence in a chimaeric plant gene negatively affects the cytoplasmic Bt ICP mRNA level directed by the chimaeric plant gene. It is shown that this negative control is not at the level of transcription initiation but at least in part due to a reduced ability of RNA polymerass II to generate precursor Bt ICP mRNA. A deletion analysis of the chimaeric Bt ICP plant gene was performed to identify whether impaired transcription elongation is the exclusive mechanism by which the Bt ICP sequence interferes with gene expression. The rationale of the experiment is that the introduction of specific deletions in the Bt ICP coding region could remove or inactivate the sequence element(s) responsible for the negative control. As a result such mutant gene would direct an increased level of cytoplasmic mRNA. This method can therefore be used to map and identify the sequence(s) involved in the negative control.

To perform this analysis, a deletion series of the bt860-neo gene (Vaeck et al., 1987) was made. FIG. 3 gives a schematic representation. The resultant deletion derivatives do not specify a Bt ICP and therefore are assayed at RNA level only. In order to obtain accurate Bt ICP mRNA concentration values, the deletion mutants were compared in a transient expression system using tobacco leaf protoplasts of SR1 (Cornelissen and Vandewiele, 1989). The relative mRNA abundances were calculated using a correction factor provided by the mRNA level specified by the cat reference gene present on the same plasmid as the mutant Bt ICP gene. Four hours after introduction of the genes the tobacco leaf protoplasts were harvested, and total RNA was prepared and analysed (FIG. 4).

The mutants nos. 50-60 (FIG. 3) show that progressive deletions of the carboxy-terminal part or the amino-terminal part of the Bt ICP coding sequence result in a gradually increasing neo transcript level. As there are not very abrupt changes in transcript levels, these results suggest that the low transcript level produced by full length Bt ICP genes is controlled by a number of signals. Deletions within the Bt ICP coding sequence indeed did not localise a specific sequence element which, by itself, is responsible for the low Bt ICP mRNA level. Similarly, cloning of fragments of the Bt ICP coding sequence in pJD50 (FIG. 3) did not allow identification of such a region.

The relative transcript levels were plotted against the length of the Bt ICP sequence present in the different deletion derivatives. FIG. 5 suggests that hybrid Bt ICP-neo transcript levels drop with increasing length of the Bt ICP sequence. In this respect, the mutants nos. 61-66 (FIG. 3) form an exception as they show in average a low transcript level relative to the length of the Bt ICP sequence.

These results show that the low transcript levels of Bt ICP plant genes in tobacco are not exclusively due to an impaired elongation of the nascent transcript but that a number of signals operate to cause a reduced expression capacity of the chimaeric Bt ICP gene.

Example 4.

To determine whether cytoplasmic events are important in causing inefficient expression of the bt2 gene in plants, the following test was carried out. Cytoplasmic bt2 mRNA steady state levels in transgenic leaf protoplasts of N28-220 are normally found to be below 1 transcript per cell. The steady state level is determined by, and is proportional to, the number of bt2 transcripts entering per time unit the cytoplasm and the cytoplasmic half-life of the transcript. When steady state levels are achieved, the absolute numbers of transcripts entering and leaving the cytoplasmic bt2 mRNA pool are equal. Therefore, the cytoplasmic half-life and cytoplasmic steady state level of the bt2 transcript will reveal whether its cytoplasmic steady state level is due to a relatively low import of bt2 transcript, a relatively high turnover (i.e., conversion to a protein) rate, or a combination of both.

The cytoplasmic turnover of bt884 transcripts was determined according to Gallie et al. (1989). A capped and polyadenylated synthetic bt884 mRNA was produced in vitro according to protocols of Promega Corporation (Madison, Wis., U.S.A.) and introduced into tobacco leaf protoplasts simultaneously with a synthetic bar (De Block et al., 1987) mRNA. The two synthetic transcripts differed only in their coding sequences. At various times after RNA delivery, samples were taken, and total RNA was isolated. Northern analyses revealed that the half-lives (T 1/2) of the synthetic bt884 and bar transcripts were about 8±3 hours and 5±2 hours, respectively. See Table 1, below. These data show that the bt884 coding sequence, more particularly the bt884 codon usage and the AU-rich motifs in the bt884 coding sequence, do not render the bt884 mRNA more unstable than the bar mRNA which is known to accumulate in the cytoplasm to about 1000 transcripts per tobacco leaf protoplast (calculated from Cornelissen, 1989). The low cytoplasmic steady state level of the bt884 transcripts is, therefore, caused by a lack of import of transcripts into the cytoplasm. Thus, the expression defect of the bt884 gene has to be restored by introduction of modifications in the bt884 coding sequence that improve the expression pathway in the nucleus.

Expression of the bt14, bt15 and bt18 genes in tobacco revealed that these genes also direct low cytoplasmic mRNA steady state levels. Therefore, a similar analysis was carried out with synthetic bt14, bt15 and bt18 transcripts to identify whether the expression defect had a cytoplasmic or nuclear character. Table 1, below, shows that all three transcripts behave as stable mRNAs in the cytoplasm of tobacco leaf protoplasts. Therefore, bt14, bt15 and bt18 genes, like the bt884 gene, must be deficient in exporting high levels of bt transcript to the cytoplasm, and to improve the expression of such genes, it is necessary to modify their coding sequences so that nuclear events, which interfere with efficient gene expression, are avoided or ameliorated.

                  TABLE 1                                                          ______________________________________                                         Half-life determination of synthetic bt and bar mRNAs                          in Nicotiana tabacum cv. Petite Havanna SR1 leaf                               protoplasts                                                                                     T 1/2              T 1/2                                      Example                                                                               1.sup.st mRNA                                                                            (Hours)    2.sup.nd mRNA                                                                          (Hours)                                    ______________________________________                                         A      bt884     8+/-3      bar     5+/-2                                      B      bt14      7+/-2      bar     6+/-3                                      C      bt15      12+/-5     bar     21+/-12                                    D      bt18      10+/-5     bar     12+/-5                                     ______________________________________                                         Legend                                                                         The synthetic bar transcripts had a length of 783 bases                        and included a cap, the TMV leader (77 bases, Danthinne                        and Van Emmelo, 1990), the bar coding sequence (552                            bases; De Block et al., 1987), a trailer of 52                                 nucleotides consisting of the bases                                            [SEQ. ID. NO: 1] GAUCA CGCGA AUU and                                           39 bases from the pGEM-3Z (Promega) polylinker (KpnI                           (T4 DNA pol.)-HindIII (T4 DNA pol.) , and a poly(A) of                         the composition (A).sub.33 G(A).sub.32 G(A).sub.32, followed by the            nucleotides GCU.                                                               The synthetic bt884 transcripts had a length of 2066                           bases and included a cap, the TMV leader (77 bases),                           the bt884 coding sequence followed by the trailer until                        the Klenow treated PstI site (1843 nucleotides), the                           trailer continued with [SEQ. ID. NO: 2] AAUUC CGGGG                            AUCAA UU, 39 bases of the pGEM-3Z polylinker                                   and the (A).sub.33 G(A).sub.32 G(A).sub.21                                     poly(A), followed by the nucleotides CG.                                       The synthetic bt14 transcripts had a length of 2289                            bases and included a cap, the TMV leader (77 bases),                           the bt14 coding sequence till the Klenow treated BclI                          site (2023 bases), plus 26 supplementary nucleotides                           [SEQ. ID. NO: 3] CG                                                            UCG ACC UGC AGC CAA GCU UGC UGA, a trailer starting                            with [SEQ. ID. NO: 4] UUGAU UGACC GGAUC CGGCU                                  CUAGA AUU, followed by 39                                                      bases of the pGEM-3Z polylinker, and the                                       (A).sub.33 G(A).sub.32 G(A).sub.21 poly(A), followed by the                    nucleotides [SEQ. ID. NO: 5] CGGUA CCC.                                        The synthetic bt15 transcripts had a length of 2198                            bases and included a cap, the TMV leader (77 bases) the                        bt15 coding sequence as in pVE35 (PCT publication                              WO90/15139) followed by the trailer till the Klenow                            treated BamHI site (1989 bases), the trailer then                              continued with AAUU, 39 bases of the pGEM-3Z polylinker                        and the (A).sub.33 G(A).sub.32 G(A).sub.21 poly(A), followed by the            nucleotides CG.                                                                The synthetic bt18 transcripts had a length of 2184                            bases and included a cap, the TMV leader (77 bases) the                        bt18 coding sequence until the Klenow treated BcLI site                        (1918 bases), followed by 26 nucleotides until the                             translation stop [SEQ. ID. NO: 3] CG UCG ACC UGC AGC                           CAA GCU UGC UGA, a trailer starting with [SEQ. ID. NO: 4]                      UUGAU UGACC GGAUC GAUCC GGCUC                                                  AGAUC AAUU, 39 bases of the pGEM-3Z polylinker and the                         (A).sub.33 G(A).sub.32 G(A).sub.21 poly(A), followed by the nucleotides        CG.                                                                       

Example 5. Construction of Modified Bt ICP Genes

Examples 1-4 show that the expression in a plant of a Bt ICP gene is negatively affected by the Bt ICP coding sequence at both transcriptional and post-transcriptional levels, but principally by nuclear events. These examples also show that the control of expression is not confined to a specific DNA sequence within the Bt ICP coding sequence. Instead, the negative effect on gene expression is an intrinsic property of the Bt ICP coding sequence. On this basis, it is believed that, by directed change of the DNA sequence of the Bt ICP coding region, an improvement of gene expression will occur. The improvement will be of a cumulative type as the negative influence of the Bt ICP coding region is spread over the complete coding sequence. Similarly, an improvement of gene expression will be obtained by reduction of the length of the Bt ICP coding sequence. This improvement will have a cumulative effect if used in combination with modifications of the Bt ICP coding region.

Therefore, two types of modifications were introduced into a Bt ICP (i.e., bt2) coding sequence which, as will be shown, indeed resulted in a significant increase in Bt ICP plant gene expression. First, the DNA sequence was modified in the central region of the toxic core fragment of the Bt ICP as transcription elongation is impaired in this region. Secondly, the length of the Bt ICP coding sequence was reduced as the negative influence is proportional to the length of the Bt ICP coding sequence. A detailed description of the mutations is given in FIGS. 6a, b and c. As shown in FIG. 7, the modifications change the AT-content of the chimaeric Bt ICP gene significantly. The modifications change the primary DNA structure of the Bt ICP coding sequence without affecting the amino acid sequence of the encoded protein. It is evident that, if more DNA mutations were to be introduced into the Bt ICP coding sequence, a further improvement of gene expression would be obtained.

To determine the effect of the modifications, the expression properties of the modified BtICP gene and the parental bt860-neo gene were compared in a transient expression system as described by Cornelissen and Vandewiele (1989) and Denecke et al. (1989). Basically, the accumulation profiles of the genes under study were compared by relating their profiles to the profile of a reference gene present in the same experiment. FIG. 8a shows the vectors used in the assay, and FIG. 8b shows that the accumulation of the reference CAT protein is nearly identical in both experiments. It is not possible to measure the accumulation of Bt ICP encoded by the parental bt860-neo gene, but the modified Bt ICP gene clearly directs an increased synthesis of Bt ICP.

These results demonstrate that mutation of the Bt ICP coding sequence relieves the negative influence of the Bt ICP coding sequence on the expression of a Bt ICP plant gene.

Example 6. Cloning and Expression of Modified BT ICP Genes in Tobacco and Potato Plants

Using the procedures described in U.S. patent application Ser. No. 821,582, filed Jan. 22, 1986, and EPA 86300291.1, EPA 88402115.5 and EPA 89400428.2, the modified Bt ICP (i.e., bt2) genes of FIGS. 6 and 7 are inserted into the intermediate T-DNA vector, pGSH1160 (Deblaere et al., 1988) between the vector's T-DNA terminal border repeat sequences.

To obtain significant expression in plants, the modified Bt ICP genes are placed under the control of the strong TR2' promoter (Velten et al., 1984) and are fused to the transcript 3' end formation and polyadenylation signals of the T-DNA gene 7 (Velten and Schell, 1985).

In addition, the translation initiation context or site are changed in accordance with the Joshi consensus sequence (Joshi, 1987) in order to optimize the translation initiation in plant cells. To this end, an oligo duplex (FIGS. 6a and 6b) is introduced to create the following sequence at translation initiation site: [SEQ ID NO:6] AAAACCATGGCT. In this way, an additional codon (i.e., GCT) coding for alanins is introduced. Additionally, KpnI and BstXI sites are created upstream of the ATG translation initiation codon.

Using standard procedures (Deblaere et al., 1985), the intermediate plant expression vectors, containing the modified BtICP gene, are transferred into the Agrobacterium strain C58Cl Rif^(R) (U.S. patent application Ser. No. 821,582; EPA 86300291.1) carrying the disarmed Ti-plasmid pGV2260 (Vaeck et al., 1987). Selection for spectinomycin resistance yields cointegrated plasmids, consisting of pGV2260 and the respective intermediate plant expression vectors. Each of these recombinant Agrobacterium strains is then used to transform different tobacco plant cells (Nicotiania tabacum) and potato plant cells (Solanum tuberosum) so that the modified Bt ICP genes are contained in, and expressed by, different tobacco and potato plant cells.

The transgenic tobacco plants containing the modified Bt ICP genes are analyzed with an ELISA assay. These plants are characterized by a significant increase in levels of Bt (Bt2) proteins, compared to a transgenic tobacco plant containing a non-modified Bt ICP (bt2) gene.

The insecticidal activity of the expression products of the modified Bt ICP (bt2) genes in leaves of transformed tobacco and potato plants is evaluated by recording the growth rate and mortality of larvae of Tobacco hornworm (Manduca sexta), Tobacco budworm (Heliotis virescens) and potato tubermoth (Phthorimaea operculella) fed on leaves of these two types of plants. These results are compared with the growth rate of larvae fed leaves from tobacco and potato plants transformed with the unmodified or parental Bt ICP (bt2) gene and from untransformed potato and tobacco plants. Toxicity assays are performed as described in EPA 88402115.5 and EPA 86300291.1.

A significantly higher mortality rate is obtained among larvae fed on leaves of transformed plants containing and expressing the modified Bt ICP genes. Tobacco and potato plants containing the modified Bt ICP genes show considerably higher expression levels of Bt ICPs compared to tobacco and potato plants containing the unmodified Bt ICP gene.

The insecticidal activity of three transgenic tobacco plants containing the modified Bt ICP genes is determined against second and third instar larvae of Hellothis virescens. The control plant was not transformed. The results are summarized in Table 2, below.

                  TABLE 2                                                          ______________________________________                                                   % mortality of insects (recorded after                               Plant     5 days)                                                              ______________________________________                                         Control   11                                                                   No. 1     100                                                                  No. 2     88.5                                                                 No. 3     100                                                                  ______________________________________                                    

Needless to say, this invention is not limited to tobacco and potato plants transformed with the modified Bt ICP gene. It includes any plant, such as tomato, alfalfa, sunflowers, corn, cotton, soybean, sugar beets, rapeseed, brassicas and other vegetables, transformed with the modified Bt ICP gene.

Nor is the invention limited to the use of Agrobacterium tumefaciens Ti-plasmids for transforming plant cells with a modified Bt ICP gene. Other known techniques for plant transformation, such as by means of liposomes, by electropotation or by vector systems based on plant viruses or pollen, can be used for transforming monocotyledonons and dicotyledons with such a modified Bt ICP gene.

Nor is the invention limited to the bt2 gene, but rather encompasses all Cry I, Cry II, CryIII and Cry IV Bt ICP genes.

References

Adang et al., Gene 36, 289-300 (1985).

Asselin et al., Oncogene 4, 549-558 (1989).

Barton et al., Plant Physiol. 85, 1103-1109 (1987).

Connely and Manley, Cell 57, 567-571 (1989).

Connely and Manley, Molecular and Cellular Biology, 9, 5254-5259 (1989).

Cornelissen, Nucl. Acids Res. 17, 7203-7209 (1989).

Cornelissen and Vandewiele, Nucl. Acids Res. 17, 833-843 (1989).

Cox and Goldberg, in: Plant Molecular Biology, A practical approach, edited by C. Shaw, IRL Press, pp. 1-35 (1988).

Danthinne and Van Emmelo, Me. Fac. Landbouww. Rijksuniv. Gent 55 (3a), 1037-1045 (1990).

Darnell, Nature 297, 365-371 (1982).

Deblaere et al., Methods in Enzymology 153, 277-292 (1988).

Deblaere et al., Nucleic Acids Research 13, 4777-4787 (1985).

De Block et al., EMBO J. 6, 2513-2518 (1987).

Delannay et al., Bio/Technology 7, 1265-1269 (1989).

Denecke et 81., Meth. Mol. Cel. Biol. 1, 19-27 (1989).

Diffley and Stillman, Mol. and Cell. Biol. 6, 1363-1373 (1986).

Dulmage H. T, "Production of Bacteria for Biological Control of Insects" in "Biological Control in Crop Production" Ed. Paparizas D. C., Osmum Publishers, Totowa, N.Y., U.S.A., pp. 129-141 (1981).

Engler et al., J. Biol. Chem. 263, 12384-12390 (1988).

Engvall and Pesce, Scand. Immunol. Suppl. 7 (1978).

Fischoff et al., Bio/Technology, 5, 807-812 (1987).

Fromm et al., Bio/Technology, 8, 833 et seq. (1990).

Gallie et al., Plant Cell 1, 301-311 (1989).

Gasser and Fraley, Science 244, 1293-1299 (1989).

Gielen et al., EMBP J 3, 835-845 (1984).

Goldberg L. and Margalit J., Mosq. News 37, 355-358 (1977).

Gordon-Kamm et al., The Plant Cell 2, 603 et seq. (1990).

Gould et al., Plant Physiol. 95, 426-434 (1991).

Hoffmann et al., Proc. Natl. Acad. Sci. U.S.A. 85, 7844-7848 (1988).

H ofte et al., Eur. J. Biochem 161, 273-280 (1986).

H ofte H. and Whiteley H. R., Microbiological Reviews 53, 242-255 (1989).

Joshi, (1987) Nucl. Acids Res. 15, 6643-6653.

Krieg et al., Z. Ang. Ent 96, 500-508 (1983).

Luthe and Quatrano, Plant and Physiology 65, 305 (1980).

Mc Pherson et al., Bio/Technology 6, 61-66 (1988).

Meeusen and Warren, Ann. Rev. Entomol. 34, 373-381 (1989).

Miller et al., Molecular and Cellular Biology 9, 5340-5349 (1989).

"The Molecular Biology of the Cell, Second Edition", Eds. Alberts et al., Garland Publishing Co., N.Y. and London (1989).

Neumann et al., Biotechniques 5, 144 (1987).

Odell et al., Nature 313, 810-812 (1985).

Schnepf et al., J. Biol. Chem. 260, 6264-6272 (1985).

Vaeck et al., Nature 327, 33-37 (1987).

Velten and Schell, Nucleic Acids Research 13, 6981-6998 (1985).

Velten et al., EMBO J 3, 2723-2730 (1984).

Widner and Whiteley, J. Bacterial 171, 965-974 (1989).

Yanisch-Perron et al., Gene 33, 103-109 (1985).

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 23                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 13 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GAUCACGCGAAUU13                                                                (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        AAUUCCGGGGAUCAAUU17                                                            (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 26 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        CGUCGACCUGCAGCCAAGCUUGCUGA26                                                   (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        UUGAUUGACCGGAUCCGGCUCUAGAAUU28                                                 (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 8 base pairs                                                       (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        CGGUACCC8                                                                      (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 12 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        AAAACCATGGCT12                                                                 (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 48 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        GTACCAAAACCATGGCTATCGAGACCGGTTACACCCCAATCGATATCG48                             (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 48 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        ATCGATTGGGGTGTAACCGGTCTCGATAGCCATGGTTTTGGTACCGAT48                             (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 48 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        ATCGGTACCAAAACCATGGCTATCGAGACCGGTTACACCCCAATCGAT48                             (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 52 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 16..51                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       ATCGGTACCAAAACCATGGCTATCGAGACCGGTTACACCCCAATCGATATC51                          MetAlaIleGluThrGlyTyrThrProIleAspIle                                           1510                                                                           G52                                                                            (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 65 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       GATCCTCTAGAGACTGGATCAGGTACAACCAGTTCAGGAGGGAGTTAACCCTAACCGTGT60                 TAGAC65                                                                        (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 71 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       ATCGTGTCCCTATTCCCGAACTACGACAGCAGGACGTACCCAATCCGAACCGTGTCCCAG60                 TTAACCAGGGA71                                                                  (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 65 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       GATCTACACCAACCCAGTGTTAGAGAACTTCGACGGTAGCTTCCGAGGCTCGGCTCAGGG60                 CATCG65                                                                        (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 65 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       AGGGAAGCATCAGGAGCCCACACTTGATGGACATCCTTAACAGCATCACCATCTACACGG60                 ACGCT65                                                                        (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 73 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       CACAGGGGAGAGTACTACTGGTCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCG60                 GGGCCAGAATTCG73                                                                (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 66 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       GATCCGAATTCTGGCCCCGAGAACCCCACAGGGGAAGCCATGATCTGGTGCCCGGACCAG60                 TAGTAC66                                                                       (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 65 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       TCTCCCCTGTGAGCGTCCGTGTAGATGGTGATGCTGTTAAGGATGTCCATCAAGTGTGGG60                 CTCCT65                                                                        (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 64 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       GATGCTTCCCTCGATGCCCTGAGCCGAGCCTCGGAAGCTACCGTCGAAGTTCTCTAACAC60                 TGGG64                                                                         (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 71 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       TTGGTGTAGATCTCCCTGGTTAACTGGGACACGGTTCGGATTGGGTACGTCCTGCTGTCG60                 TAGTTCGGGAA71                                                                  (2) INFORMATION FOR SEQ ID NO:20:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 73 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                       TAGGGACACGATGTCTAACACGGTTAGGGTTAACTCCCTCCTGAACTGGTTGTACCTGAT60                 CCAGTCTCTAGAG73                                                                (2) INFORMATION FOR SEQ ID NO:21:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 343 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 3..341                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                       GATCCTCTAGAGACTGGATCAGGTACAACCAGTTCAGGAGGGAGTTA47                              SerSerArgAspTrpIleArgTyrAsnGlnPheArgArgGluLeu                                  151015                                                                         ACCCTAACCGTGTTAGACATCGTGTCCCTATTCCCGAACTACGACAGC95                             ThrLeuThrValLeuAspIleValSerLeuPheProAsnTyrAspSer                               202530                                                                         AGGACGTACCCAATCCGAACCGTGTCCCAGTTAACCAGGGAGATCTAC143                            ArgThrTyrProIleArgThrValSerGlnLeuThrArgGluIleTyr                               354045                                                                         ACCAACCCAGTGTTAGAGAACTTCGACGGTAGCTTCCGAGGCTCGGCT191                            ThrAsnProValLeuGluAsnPheAspGlySerPheArgGlySerAla                               505560                                                                         CAGGGCATCGAGGGAAGCATCAGGAGCCCACACTTGATGGACATCCTT239                            GlnGlyIleGluGlySerIleArgSerProHisLeuMetAspIleLeu                               657075                                                                         AACAGCATCACCATCTACACGGACGCTCACAGGGGAGAGTACTACTGG287                            AsnSerIleThrIleTyrThrAspAlaHisArgGlyGluTyrTyrTrp                               80859095                                                                       TCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCGGGGCCAGAA335                            SerGlyHisGlnIleMetAlaSerProValGlyPheSerGlyProGlu                               100105110                                                                      TTCGGATC343                                                                    PheGly                                                                         (2) INFORMATION FOR SEQ ID NO:22:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3201 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: misc.sub.-- feature                                              (B) LOCATION: 2151..2155                                                       (D) OTHER INFORMATION: /note= "Nucleotides 2151-2155                           wherein N is not known."                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                       AAATGGATAAATAGCCTTGCTTCCTATTATATCTTCCCAAATTACCAATACATTACACTA60                 GCATCTGAATTTCATAACCAATCTCGATACACCAAATCGATGGATCCCGATAACAATCCG120                AACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGAAGTAGAAGTATTAGGT180                GGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATTTCCTTGTCGCTAACGCAATTT240                CTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTAGTTGATATAATATGG300                GGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATTGAACAGTTAATTAAC360                CAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTAGAAGGACTAAGCAAT420                CTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGATCCTACTAATCCAGCA480                TTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCCCTTACAACCGCTATT540                CCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCA600                AATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAAAGGTGGGGATTTGAT660                GCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCTTATTGGCAACTATACAGAT720                CATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGACCGGATTCTAGAGAT780                TGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTATTAGATATCGTTTCT840                CTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTTTCCCAATTAACAAGA900                GAAATTTATACAAACCCAGTATTAGAAAATTTTGATGGTAGTTTTCGAGGCTCGGCTCAG960                GGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTTAACAGTATAACCATC1020               TATACGGATGCTCATAGAGGAGAATATTATTGGTCAGGGCATCAAATAATGGCTTCTCCT1080               GTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATGGAACTATGGGAAATGCAGCT1140               CCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGAACATTATCGTCCACT1200               TTATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTATCTGTTCTTGACGGG1260               ACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTATACAGAAAAAGCGGA1320               ACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGA1380               TTTAGTCATCGATTAAGCCATGTTTCAATGTTTCGTTCAGGCTTTAGTAATAGTAGTGTA1440               AGTATAATAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCTGAATTTAATAATATA1500               ATTCCTTCATCACAAATTACACAAATACCTTTAACAAAATCTACTAATCTTGGCTCTGGA1560               ACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTCTTCGAAGAACTTCACCT1620               GGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTATCACAAAGATATCGGGTA1680               AGAATTCGCTACGCTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCT1740               ATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGC1800               TTTAGGACTGTAGGTTTTACTACTCCGTTTAACTTTTCAAATGGATCAAGTGTATTTACG1860               TTAAGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAGATCGAATTGAATTTGTT1920               CCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAGCACAAAAGGCGGTGAAT1980               GAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATGTGACGGATTATCATATT2040               GATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTTTGTCTGGATGAAAAAAAA2100               GAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAANNNNNCCTCG2160               AGCTTGGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTAT2220               GACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAG2280               GGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGAC2340               GAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGAC2400               GTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTC2460               CTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGG2520               CTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAG2580               CGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCAT2640               CAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAG2700               GATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGC2760               TTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCG2820               TTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTG2880               CTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAG2940               TTCTTCTGACAGATCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACAC3000               ATAATATCGCACTCAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTG3060               AAATATTTCTGAATTTAAACTTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATAC3120               CTGACTTGTTATTTTATCAATAAATATTTAAACTATATTTCTTTCAAGATGGGAATTAAC3180               ATCTACAAATTGCCTTTTCTT3201                                                      (2) INFORMATION FOR SEQ ID NO:23:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3200 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: misc.sub.-- feature                                              (B) LOCATION: 2078..2082                                                       (D) OTHER INFORMATION: /note= "Nucleotides 2078-2082                           wherein N is not known."                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                       AAATGGATAAATAGCCTTGCTTCCTATTATATCTTCCCAAATTACCAATACATTACACTA60                 GCATCTGAATTTCATAACCAATCTCGATACACCAAATCGGTACCAAAACCATGGCTATCG120                AGACCGGTTACACCCCAATCGATATTTCCTTGTCGCTAACGCAATTTCTTTTGAGTGAAT180                TTGTTCCCGGTGCTGGATTTGTGTTAGGACTAGTTGATATAATATGGGGAATTTTTGGTC240                CCTCTCAATGGGACGCATTTCTTGTACAAATTGAACAGTTAATTAACCAAAGAATAGAAG300                AATTCGCTAGGAACCAAGCCATTTCTAGATTAGAAGGACTAAGCAATCTTTATCAAATTT360                ACGCAGAATCTTTTAGAGAGTGGGAAGCAGATCCTACTAATCCAGCATTAAGAGAAGAGA420                TGCGTATTCAATTCAATGACATGAACAGTGCCCTTACAACCGCTATTCCTCTTTTTGCAG480                TTCAAAATTATCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCAAATTTACATTTAT540                CAGTTTTGAGAGATGTTTCAGTGTTTGGACAAAGGTGGGGATTTGATGCCGCGACTATCA600                ATAGTCGTTATAATGATTTAACTAGGCTTATTGGCAACTATACAGATCATGCTGTACGCT660                GGTACAATACGGGATTAGAGCGTGTATGGGGACCGGATTCTAGAGACTGGATCAGGTACA720                ACCAGTTCAGGAGGGAGTTAACCCTAACCGTGTTAGACATCGTGTCCCTATTCCCGAACT780                ACGACAGCAGGACGTACCCAATCCGAACCGTGTCCCAGTTAACCAGGGAGATCTACACCA840                ACCCAGTGTTAGAGAACTTCGACGGTAGCTTCCGAGGCTCGGCTCAGGGCATCGAGGGAA900                GCATCAGGAGCCCACACTTGATGGACATCCTTAACAGCATCACCATCTACACGGACGCTC960                ACAGGGGAGAGTACTACTGGTCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCGG1020               GGCCAGAATTCACTTTTCCGCTATATGGAACTATGGGAAATGCAGCTCCACAACAACGTA1080               TTGTTGCTCAACTAGGTCAGGGCGTGTATAGAACATTATCGTCCACTTTATATAGAAGAC1140               CTTTTAATATAGGGATAAATAATCAACAACTATCTGTTCTTGACGGGACAGAATTTGCTT1200               ATGGAACCTCCTCAAATTTGCCATCCGCTGTATACAGAAAAAGCGGAACGGTAGATTCGC1260               TGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGATTTAGTCATCGAT1320               TAAGCCATGTTTCAATGTTTCGTTCAGGCTTTAGTAATAGTAGTGTAAGTATAATAAGAG1380               CTCCTATGTTCTCTTGGATACATCGTAGTGCTGAATTTAATAATATAATTCCTTCATCAC1440               AAATTACACAAATACCTTTAACAAAATCTACTAATCTTGGCTCTGGAACTTCTGTCGTTA1500               AAGGACCAGGATTTACAGGAGGAGATATTCTTCGAAGAACTTCACCTGGCCAGATTTCAA1560               CCTTAAGAGTAAATATTACTGCACCATTATCACAAAGATATCGGGTAAGAATTCGCTACG1620               CTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCTATTAATCAGGGGA1680               ATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCTTTAGGACTGTAG1740               GTTTTACTACTCCGTTTAACTTTTCAAATGGATCAAGTGTATTTACGTTAAGTGCTCATG1800               TCTTCAATTCAGGCAATGAAGTTTATATAGATCGAATTGAATTTGTTCCGGCAGAAGTAA1860               CCTTTGAGGCAGAATATGATTTAGAAAGAGCACAAAAGGCGGTGAATGAGCTGTTTACTT1920               CTTCCAATCAAATCGGGTTAAAAACAGATGTGACGGATTATCATATTGATCAAGTATCCA1980               ATTTAGTTGAGTGTTTATCTGATGAATTTTGTCTGGATGAAAAAAAAGAATTGTCCGAGA2040               AAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAANNNNNCCTCGAGCTTGGATGGAT2100               TGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAAC2160               AGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTC2220               TTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGC2280               TATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAG2340               CGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACC2400               TTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTG2460               ATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTC2520               GGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGC2580               CAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGA2640               CCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCA2700               TCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTG2760               ATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCG2820               CCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGACAGA2880               TCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGCACT2940               CAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATATTTCTGAA3000               TTTAAACTTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATT3060               TTATCAATAAATATTTAAACTATATTTCTTTCAAGATGGGAATTAACATCTACAAATTGC3120               CTTTTCTTATCGACCATGTACGGGTACCGAGCTCGAATTCCACGCAGCAGGTCTCATCAA3180               GACGATCTACCCGAGTAACA3200                                                       __________________________________________________________________________ 

We claim:
 1. A modified DNA sequence encoding a Bacillus thuringiensis Bt2 insecticidal crystal protein (ICP) or an insecticidally effective fragment thereof, which comprises the DNA sequence of SEQ ID No. 23 from nucleotide position 705 to nucleotide position
 1016. 2. The modified DNA of claim 1, wherein said DNA encodes a Bt2 ICP which is truncated at its N- and/or C-terminal end towards a trypsin cleavage site.
 3. The modified DNA of claim 1, wherein said DNA is further modified by substituting for its ATG translation initiation site: AAAACCATGGCT.
 4. The modified DNA of claim 1, which comprises the coding region of SEQ ID No. 23 from nucleotide position 111 to nucleotide position
 2077. 5. The modified DNA of claim 4, wherein said coding region is truncated at its N- and/or C-terminal end so that it still encodes an insecticidally effective Bt2 protein.
 6. A modified DNA sequence encoding a Bt2 protein or an insecticidally effective part thereof, wherein A and T nucleotides are changed to G and C nucleotides in about 63 codons of the DNA sequence corresponding to the DNA sequence from nucleotide 778 to nucleotide position 1089 of SEQ ID No.
 22. 7. The modified DNA of claim 3, wherein the native Bt2 codons 2 to 28 are deleted and the native Bt2 codons 29 to 31 are changed by replacing A and T nucleotides to corresponding G and C nucleotides encoding the same amino acids.
 8. The modified DNA of claim 1, wherein A and T nucleotides are changed to G and C nucleotides in about 3 codons at the translation initiation site.
 9. A chimeric gene for transforming a cell of a plant, comprising the following operably-linked DNA fragments in the same transcriptional unit:(a) the modified DNA sequence of claim 1 or claim 6; (b) a promoter capable of directing transcription of the modified DNA in a plant cell; and (c) transcription 3' end formation and polyadenylation signals for expressing the modified DNA sequence in the plant cell.
 10. A chimeric gene for transforming a cell of a plant, comprising the following operably-linked DNA fragments in the same transcriptional unit:(a) the modified DNA sequence of claim 7; (b) a promoter capable of directing transcription of the modified DNA in a plant cell; and (c) transcription 3' end formation and polyadenylation signals for expressing the modified DNA sequence in the plant cell.
 11. A plant genome, comprising the chimaeric gene of claim
 10. 12. A plant cell, transformed with the chimaeric gene of claim
 10. 13. A plant, comprising the chimaeric gene of claim 10 stably integrated into its genome. 